METHOD AND SYSTEM FOR PERFORMING ASYNCHRONOUS 
PREFORMATTING IN A DATABASE SYSTEM 



FIELD OF THE INVENTION 

The present invention relates to computer systems, and more particularly to a method 
and system for performing formatting asynchronously in a database system. 

BACKGROUND OF THE INVENTION 

Conventional database management systems are used for a variety of applications. 
Figure 1 depicts a portion of a computer system 10 on which the conventional database 
system operates. The computer system 10 is typically platform, such as a server. The 
computer system 10 a processor 12, includes a storage device, such as a disk 14, and an 
input/output device 16, such as a keyboard and display, which may be coupled to a 
workstation. In order to operate the conventional database system, a portion of the disk 14 is 
formatted to allow data input to the database system to be stored. This formatting is specific 
to the conventional database system desired to be used. In general, each conventional 
database system utilizes a particular format for storing and accessing data. The computer 
system 10 might support a variety of conventional database systems, as well as other 
applications. Thus, each time a particular conventional database system is to be used to store 
data, some portion of the disk 14 is formatted for that particular conventional database 
system. The formatting might include overwriting a portion of the disk as well as providing 
control information to allow the conventional database system to access the stored data. 
Typically, only a portion of the disk 14 is formatted because other applications might use 
remaining portions of the disk 14 to store data. Furthermore, formatting for a particular 
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conventional database system is performed by the conventional database system only during 
operation of the conventional database system. 

Figure 2 depicts a conventional method 20 for formatting a storage device, such as 
the disk 14, for use by a conventional database system. Thus, the conventional method 20 
may be considered to be part of and carried out by a conventional database system. The 
conventional method 20 is described in conjunction with the computer system 10. Referring 
to Figures 1 and 2, the conventional method 20 commences after a new file for the 
conventional database system is created. For example, the conventional method 20 may 
begin when a use opens a new table for storing data. A first element of the disk drive is 
formatted by the conventional database management system, via step 22. The size of the 
element in step 22 may depend upon the particular conventional database system. The actual 
formatting in step 22 may include using a mask to write a specific pattern in the element as 
well as providing control information, for example in a header. Input from and output to a 
user, as well as storage of data on the disk 14 are carried out, via step 24. It is determined 
whether data is to be stored such that the end of the current (first) increment is reached, via 
step 26. If not, then operation continues in step 24. However, if the end of the increment has 
been reached, then a new element is formatted for use by the conventional database system, 
via step 28. Any data that was to be stored but could not because the end of the first element 
was reached is then stored in the new element, via step 30. Thus, access to the disk 14 by the 
database system is stalled during formatting of the next increment in step 30. The method 
then returns to normal operations in step 24. The method 20 is typically repeated until the 
file for the database system is closed. 

Thus, the conventional method 20 allows the conventional database system to format 
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increments and store data on a storage device such as the disk 14, one of ordinary skill in the 
art will readily realize that the conventional method 20 results in delays to the user. The 
conventional method 20 formats a new increment only when a current increment is filled. 
Thus, if the data being stored would exceed space available in the current increment, then 
the storage operation is delayed until the subsequent increment is formatted. Once the 
subsequent increment is formatted, the data may be stored and operation can continue. For 
large amounts of data being stored, this delay may be significant, for example up to several 
seconds. Thus, the user becomes aware of a delay in inputting data to the conventional 
database system. This slowdown in performance of the conventional database management 
system is undesirable. 

Accordingly, what is needed is a system and method for more efficiently formatting 
the storage device. The present invention addresses such a need. 

SUMMARY OF THE INVENTION 

The present invention provides a method and system for formatting space in a 
storage device for a database system. The method and system comprise formatting a first 
increment in the storage device for creating a database file and asynchronously formatting at 
least one subsequent increment in the storage device for the database file in a background of 
the database system. 

According to the system and method disclosed herein, the present invention allows 
space on a disk to be formatted while allowing the user to experience reduced delays in 
processing input requests. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of a conventional computer system in which the present 
invention may be utilized. 

Figure 2 is a flow chart depicting a conventional method for formatting a storage 
device for a database system. 

Figure 3 is a high level block diagram of a database system in accordance with the 
present invention. 

Figure 4 is a high level flow chart of one embodiment of a method in accordance 
with the present invention for formatting a storage device of a database system. 

Figure 5 is a flow chart of one embodiment of a method in accordance with the 
present invention for formatting a storage device of a database system. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an improvement in database systems. The following 
description is presented to enable one of ordinary skill in the art to make and use the 
invention and is provided in the context of a patent application and its requirements. Various 
modifications to the preferred embodiment will be readily apparent to those skilled in the art 
and the generic principles herein may be applied to other embodiments. Thus, the present 
invention is not intended to be limited to the embodiment shown, but is to be accorded the 
widest scope consistent with the principles and features described herein. 

Conventional database management systems are used for storing and performing a 
variety of operations on data. Each database management system typically utilizes a 
particular format for storing data. Thus, each conventional database management system 
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must typically format a portion of a storage device during operation. In order to store data, 
the conventional database management system formats a new increment only once it is 
determined that the present formatted increment is to be filled. During formatting of the new 
increment, access to the storage device by the conventional database system is suspended. 
As a result, a user might experience a delay each time an increment is filled. Thus, 
performance of the conventional database management system is adversely affected. 

The present invention provides a method and system for formatting space in a 
storage device for a database system. The method and system comprise formatting a first 
increment in the storage device for creating a database file and asynchronously formatting at 
least one subsequent increment in the storage device for the database file as a background 
process. 

The present invention will be described in terms of method having particular steps. 
However, one of ordinary skill in the art will readily recognize that this method and system 
will operate effectively for other methods consistent with the present invention having 
additional or different steps. The present invention is described in the context of a particular 
type of storage device, a disk. However, one of ordinary skill in the art will readily recognize 
that the method and system are consistent with the use of another storage device. 
Furthermore, the present invention will be described in terms of particular components in a 
system and particular steps in a method. However, one of ordinary skill in the art will realize 
that certain steps or components have been omitted for clarity. 

To more particularly illustrate the method and system in accordance with the present 
invention, refer now to Figure 3, depicting a high-level block diagram of a database system 
50 in accordance with the present invention. The database system 50 may be used in 
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computer system 10 and is preferably implemented by the processor 12. The database 
system 50 is coupled with the input/output 16 in order to receive and display data. The 
database system 50 is also coupled with the storage device 14 in order to store and retrieve 
data. The database system 50 includes an asynchronous formatting block 52. The 
asynchronous formatting block 52 allows the database system 50 to asynchronously format 
increments during operation of the database system 50. In particular, the asynchronous 
formatting block 52 allows the database system to format increments in the background of 
the database system 50 without waiting for the end of a current increment to be reached. 
Operation of the asynchronous formatting block 52 is more particularly described below in 
the methods 100 and 110. Thus, the access delays associated formatting subsequent 
increments during operation of the database system 50 are reduced. 

Figure 4 depicts a high-level flow chart of one embodiment of a method 100 for 
formatting a storage device for use by a database system. The method 100 is preferably 
included in the database management system 50 and is preferably implemented using the 
asynchronous formatting block 52. The method 100 is described in the context of the 
conventional system 10. The method 100 preferably commences after a new file for the 
database system 50 is opened. For example, the method 100 may begin after a new table is 
opened for use. Referring to Figures 1 and 4, the first increment of the storage device, such 
as the disk drive 14, is formatted, via step 102. The size of the increment preferably depends 
upon the application. However, in a preferred embodiment, each increment formatted has the 
same size. Subsequent increment(s) in the storage device 14 are asynchronously formatted 
concurrently in the background of the database system 50, preferably as a background 
process, via step 104. Formatting the increment in step 104 preferably includes overwriting 
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the increment with all zeroes, which allows the database system 50 to rapidly determine 
whether a particular increment has been utilized for storage. Step 104 might also include 
providing control data in the increment during formatting. Formatting of the subsequent 
increments is considered asynchronous because formatting is not based upon the end of the 
previous increment being reached. In a preferred embodiment, formatting of a subsequent 
increment begins when the previous increment is first used to store data. In alternate 
embodiments, asynchronous formatting could begin when a particular threshold for the 
previous increment is reached or could be performed continuously. 

Using the method 100, increments of a storage device such as a disk drive 14 can be 
formatted asynchronously. As a result, a user need not have to wait because formatting 
commences when an input operation is pending. In addition, because asynchronous 
formatting is performed in the background, normal operation of the database system 50 may 
not be adversely affected. In other words, a user could continue to input data and data could 
be stored in parallel with the formatting performed in step 104 of the method 100. 
Consequently, performance of the database system 50 is improved. 

Figure 5 is a more detailed flow chart of one embodiment of a method 1 10 in 
accordance with the present invention for formatting a storage device for use by a database 
system 50. The method 100 is preferably included in the database management system 50 
and is preferably implemented using the asynchronous formatting block 52. The method 1 10 
is described in the context of the conventional system 10. The method 110 preferably 
commences after a new file for the database system 50 is opened. For example, the method 
110 may begin after a new table is opened for use. Referring to Figures 1 and 5, the first 
increment of the storage device, such as the disk drive 14, is formatted, via step 112. The 
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size of the increment preferably depends upon the application. However, in a preferred 
embodiment, each increment formatted has the same size. Input from and output to a user, as 
well as storage of data on the disk 14 are carried out, via step 1 14. Thus, normal operation of 
the database system 50 is carried out in step 1 14. It is determined whether a particular 
condition relating to formatting is satisfied, via step 1 16. In a preferred embodiment, the 
condition is the first use of the increment currently in use (e.g. the first increment when only 
the first increment has been formatted). In an alternate embodiment, the condition could be a 
particular threshold being reached and/or exceeded. In such an embodiment, the threshold 
might be one-half of the space available in the current increment. The threshold of one-half 
is selected because it is possible that a higher threshold might result in delays where large 
amounts of information are being input to the file for the database. A higher threshold might 
result in delays because the large amounts of information might fill the capacity of the 
current increment prior to the subsequent increment being formatted. 

If the condition has not been fulfilled, then normal operation of the database system 
is continued in step 114. However, if the condition has been fulfilled, then the database 
system is triggered to commence asynchronous formatting of a subsequent increment in the 
background, via step 118. The database system formats the subsequent increment in the 
background, via step 120. Formatting the increment in step 120 preferably includes 
overwriting the increment with all zeroes, which allows the database system 50 to rapidly 
determine whether a particular increment has been utilized for storage. Step 120 might also 
include providing control data in the increment during formatting. Normal operation is also 
carried out in step 1 14. Thus, steps 1 14 and 120 may be performed in parallel. 

Using the method 110, increments of a storage device such as a disk drive 14 can be 
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formatted asynchronously. As a result, a user need not have to wait because formatting 
commences when an input operation is pending. In addition, because asynchronous 
formatting is performed in the background, normal operation of the database system may not 
be adversely affected. Consequently, performance of the database system is improved. 

A method and system has been disclosed for asynchronously formatting files for a 
database system. Software written according to the present invention is to be stored in some 
form of computer-readable medium, such as memory, CD-ROM or transmitted over a network, 
and executed by a processor. Consequently, a computer-readable medium is intended to 
include a computer readable signal which, for example, may be transmitted over a network. 
Although the present invention has been described in accordance with the embodiments 
shown, one of ordinary skill in the art will readily recognize that there could be variations to 
the embodiments and those variations would be within the spirit and scope of the present 
invention. Accordingly, many modifications may be made by one of ordinary skill in the art 
without departing from the spirit and scope of the appended claims. 
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